Method for testing NLP models with text adversarial examples
Annotation
At present, the interpretability of Natural Language Processing (NLP) models is unsatisfactory due to the imperfection of the scientific and methodological apparatus for describing the functioning of both individual elements and models as a whole. One of the problems associated with poor interpretability is the low reliability of the functioning of neural networks that process natural language texts. Small perturbations in text data are known to affect the stability of neural networks. The paper presents a method for testing NLP models for the threat of evasion attacks. The method includes the following text adversarial examples generations: random text modification and modification generation network. Random text modification is made using homoglyphs, rearranging text, adding invisible characters and removing characters randomly. The modification generation network is based on a generative adversarial architecture of neural networks. The conducted experiments demonstrated the effectiveness of the testing method based on the network for generating text adversarial examples. The advantage of the developed method is, firstly, in the possibility of generating more natural and diverse adversarial examples, which have less restrictions, and, secondly, that multiple requests to the model under test are not required. This may be applicable in more complex test scenarios where interaction with the model is limited. The experiments showed that the developed method allowed achieving a relatively better balance of effectiveness and stealth of textual adversarial examples (e.g. GigaChat and YaGPT models tested). The results of the work showed the need to test for defects and vulnerabilities that can be exploited by attackers in order to reduce the quality of the functioning of NLP models. This indicates a lot of potential in terms of ensuring the reliability of machine learning models. A promising direction is the problem of restoring the level of security (confidentiality, availability and integrity) of NLP models.
Keywords
Постоянный URL
Articles in current issue
- Development of adaptive laser head for compensating error of beam waist position during processing materials using laser beam spot detection method
- Investigation of changes in the sensitivity of a fiber Bragg grating to temperature and strain using coatings from low-melting metal
- Cross-polarization coupling in polarization maintaining fiber induced by periodic mechanical stress
- Lyapunov function search method for analysis of nonlinear systems stability using genetic algorithm
- Robust disturbances compensation for MIMO linear systems with unmeasured state vector and control delay
- Trajectory tracking control for mobile robots with adaptive gain
- Switching the electrical properties of thin-film memristive elements based on GeTe by sequences of ultrashort laser pulses
- Spectral and kinetic characteristics of ultrathin cadmium selenide nanoscrolls
- Method for optimization of camera installation parameters for video monitoring of arbitrary surveillance zone
- The use of anthropometric points to introduce restrictions into the synthesis of a 3D model of the human body using SMPL
- A new efficient adaptive rood pattern search motion estimation algorithm
- Clustering in big data analytics: a systematic review and comparative analysis (review article)
- Segmentation of word gestures in sign language video
- A method for constructing interpretable hidden Markov models for the task of identifying binding cores in sequences
- Job scheduling in a distributed computing system on a chip with power consumption minimization
- System for customers’ routing based on their emotional state and age in public services systems
- Sedentary behavior health outcomes and identifying the uncertain behavior patterns in adult
- Confidence Lipschitz classifiers: an instrument of guaranteed reliability
- Visual programming environment for multidimensional fuzzy interval-logic regulators
- Solving the problem of spatial rotation of 3D surfaces and their mapping on the plane
- Analytical and simulation modeling of flexible joints for mechatronic and robotic systems
- Study of heat and mass transfer processes in the Fe-Sn reaction crucible in the presence of high-density electric current
- Measurement of the refractive index using an autocollimation goniometer